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Mouse and human utrophin gene promoters are provided. The promoters fragments and derivatives may be used to control 
transcription of heterologous sequences, including coding sequences of reporter genes. Expression systems such as host cells containing 
nucleic acid constructs which comprise a promoter as provided operably linked to a heterologous sequence may be used to screen substances 
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Utrophin gene promoter 

The present invention is based on cloning of a 
genomic promoter region of the human utrophin gene and 
of the mouse utrophin gene. 
5 The severe muscle wasting disorders Duchenne 

muscular dystrophy (DMD) and the less debilitating 
Becker muscular dystrophy <BMD) are due to mutations in 
the dystrophin gene resulting in a lack of dystrophin 
or abnormal expression of truncated forms of 
10 dystrophin, respectively. Dystrophin is a large 

cytoskeletal protein (427kS)a with a length of 125nm) 
which in muscle is located at the cytoplasmic surface 
of the sarcolemma, the neuromuscular junction (NMJ) and 
myotendinous junction (MTJ) . It binds to a complex of 
15 proteins and glycoproteins spanning the sarcolemma 

called the dystrophin associated glycoprotein complex 
(DGC) . The breakdown of the integrity of this complex 
due to loss of , or impairment of dystrophin function, 
leads to muscle degeneration and the DMD phenotype . 
20 The dystrophin gene is the largest gene so far 

identified in man, covering over 2.7 megabases and 
containing 79 exons. The corresponding 14kb dystrophin 
mRNA is expressed predominantly in skeletal, cardiac 
and smooth muscle with lower levels in brain. 
25 Transcription of dystrophin in different tissues is 

regulated from either the brain promoter (predominantly 
active in neuronal cells) or muscle promoter 
(differentiated myogenic cells, and primary glial 
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cells) giving rise to differing first exons. A third 
promoter between the muscle promoter and the second 
exon of dystrophin regulates expression in cerebellar 
Purkinje neurons. Recently reviewed in [1,2,3]. 
5 There are various approaches which have been 

adopted for the gene therapy of DMD, using the mdx 
mouse as a model system. However, there are 
considerable problems related to the number of muscle 
cells that can be made dystrophin positive, the levels 
10 of expression of the gene and the duration of 

expression [4] . It has also become apparent that 
simply re- introducing genes expressing the dystrophin 
carboxy- terminus has no effect on the dystrophic 
phenotype although the DGC appears to be re-established 
15 at the sarcolerama [5,6]. 

In order to circumvent some of these problems, 
possibilities of compensating for dystrophin loss using 
a related protein, utrophin, are being explored as an 
alternative route to dystrophin gene therapy. A 
20 similar strategy is currently being evaluated in 

clinical trials to upregulate foetal haemoglobin to 
compensate for the affected adult-globin chains in 
patients with sickle cell anaemia [7,8]. 

Utrophin is a 395JcDa protein encoded by 
25 multiexonic 1Mb UTRN gene located on chromosome 6q24 
[9] . At present the tissue regulation of utrophin is 
not fully understood. Unlike dystrophin, only one 
promoter has so far been detected. In the dystrophin 
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deficient mdx mouse, utrophin levels in muscle remain 
elevated soon after birth compared with normal mice; 
once the utrophin levels have decreased to the adult 
levels {about 1 week after birth) , the first signs of 
5 muscle fibre necrosis are detected. However there is 
evidence to suggest that in the small calibre muscles, 
continual increased levels of utrophin can interact 
with the DGC complex (or an antigenically related 
complex) at the sarcolemma thus preventing loss of the 
10 complex with the result that these muscles appear 

normal. There is also a substantial body of evidence 
demonstrating that utrophin is capable of localising to 
the sarcolemma in normal muscle. During fetal muscle 
development there is increased utrophin expression, 
15 localised to the sarcolemma, up until 18 weeks in the 
human and 20 days gestation in the mouse. After this 
time the utrophin sarcolemmal staining steadily 
decreases to the significantly lower adult levels 
shortly before birth where utrophin is localised almost 
20 exclusively to the NMJ. The decrease in utrophin 
expression coincides with increased expression of 
dystrophin . See reviews [1,2,3] . 

Thus, in certain circumstances utrophin can 
localise to the sarcolemma probably at the same binding 
25 sites as dystrophin, through interactions with actin 

and the DGC. Accordingly, if expression of utrophin is 
sufficiently elevated, it may maintain the DGC and thus 
alleviate muscle degeneration in DMD/BMD patients til] . 
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However, manipulation of utrophin expression and 
screening for molecules able to upregulate expression 
is hampered by the utrophin promoter not having been 
cloned. Though a cDNA encoding sequence for utrophin 
5 is available, the sequence of the upstream untranslated 
region is not and, given the large number of introns in 
the utrophin gene [9] , it is not surprising that there 
has been no reported successful identification and 
isolation of the utrophin promoter. 
10 We have now cloned a utrophin promoter and the 

present invention in various aspects and embodiments is 
based on the sequence information obtained and provided 
herein . 

One major use of the promoter is in screening for 
15 substances able to modulate its activity. It is well 
known that pharmaceutical research leading to the 
identification of a new drug generally involves the 
screening of very large numbers of candidate 
substances, both before and even after a lead compound 
20 has been found. This is one factor which makes 
pharmaceutical research very expensive and time- 
consuming. A method or means assisting in the 
screening process will have considerable commercial 
importance and utility. Substances identified as 
25 upregulators of the utrophin promoter represent an 

advance in the fight against muscular dystrophy since 
they provide basis for design and investigation of 
therapeutics for in vivo use. 
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According to a first aspect, the present invention 
provides a nucleic acid isolate comprising a utrophin 

gene promoter. 

In a second aspect, the present invention provides 
5 a nucleic acid isolate comprising a promoter, the 

promoter comprising a sequence of nucleotides shown in 
Figure 2 or Figure 5. The promoter may comprise one or 
more fragments of the sequence shown in Figure 2 or 
Figure 5, sufficient to promote gene expression. The 
10 promoter may comprise or consist essentially of a 

sequence of nucleotides 5' to position 898 in Figure 2 
or Figure 5 r for the human, or an equivalent sequence 
in the mouse. Preferably such a promoter comprises or 
consists essentially of nucleotides numbered 746 - 944 
15 in the sequence shown in Figure 2 or the human sequence 
of Figure 5, or the equivalent mouse sequence of Figure 
5. An even smaller portion of this part of the 
sequence of Figure 2 or Figure 5 may be used as long as 
promoter activity is retained. Restriction enzyme or 
20 nucleases may be used to digest the nucleic 

followed by an appropriate assay (for example as 
illustrated herein using luciferase constructs) to 
determine the minimal sequence required. A preferred 
embodiment of the present invention provides a nucleic 
25 acid isolate with the minimal nucleotide sequence shown 
in Figure 2 or Figure 5 required for promoter activity. 
The minimal promoter element is situated between the 
Smal and EagI restriction sites located at 746 and 
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942bp respectively in the human sequence and 1486 and 
1645bp respectively in the mouse sequence (see Figure 

5) . 

The promoter may comprise one or more sequence 
5 motifs or elements conferring developmental and/or 

tissue- specific regulatory control of expression. For 
instance, the promoter may comprise a sequence for 
muscle- specific expression, e.g. an E-box element/myoD 
binding site, such as CANNTG, preferably CAGGTG. This 
10 may be upstream (5') of a promoter sequence as 

discussed above, e.g. at nucleotides 498-503 using the 
numbering of Figure 2. The promoter may comprise a 
sequence for synaptic expression, e.g. TTCCGG, which 
may be upstream (5') of a promoter sequence as 
15 discussed above, e.g. at nucleotides 591-596 using the 
numbering of Figure 2. Other regulatory sequences may 
be included, for instance as identified by mutation or 
digest assay in an appropriate expression system or by 
sequence comparison with available information, e.g. 
20 using a computer to search on-line databases. 

By "promoter 11 is meant a sequence of nucleotides 
from which transcription may be initiated of DNA 
operably linked downstream (i.e. in the 3' direction on 
the sense strand of double- stranded DNA) . 
25 "Operably linked" means joined as part of the same 

nucleic acid molecule, suitably positioned and oriented 
for transcription to be initiated from the promoter. 
DNA operably linked to a promoter is "under 
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transcriptional initiation regulation" of the promoter. 

The present invention extends to a promoter which 
has a nucleotide sequence which is allele, mutant, 
variant or derivative, by way of nucleotide addition, 

5 insertion, substitution or deletion of a promoter 
sequence as provided herein. Systematic or random 
mutagenesis of nucleic acid to make an alteration to 
the nucleotide sequence may be performed using any 
technique known to those skilled in the art. One or 

10 more alterations to a promoter sequence according to 

the present invention may increase or decrease promoter 
activity, or increase or decrease the magnitude of the 
effect of a substance able to modulate the promoter 
activity. 

15 "Promoter activity" is used to refer to ability to 

initiate transcription. The level of promoter activity 
is quantifiable for instance by assessment of the 
amount of mRNA produced by transcription from the 
promoter or by assessment of the amount of protein 

20 product produced by translation of mRNA produced by 
transcription from the promoter. The amount of a 
specific mRNA present in an expression system may be 
determined for example using specific oligonucleotides 
which are able to hybridise with the mRNA and which are 

25 labelled or may be used in a specific amplification 

reaction such as the polymerase chain reaction. Use of 
a reporter gene" as discussed further below facilitates 
determination of promoter activity by reference to 
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protein production. 

In various embodiments of the present invention a 
promoter which has a sequence that is a fragment, 
mutant, allele, derivative or variant, by way of 
5 addition, insertion, deletion or substitution of one or 
more nucleotides, of the sequence of the promoter shown 
in Figure 2 or the mouse promoter shown in Figure 5, 
has at least about 60* homology with one or both of the 
shown sequences, preferably at least about 70% 
10 homology, more preferably at least about 80* homology, 
more preferably at least about 90* homology, more 
preferably at least about 95* homology. The sequence 
in accordance with an embodiment of the invention may 
hybridise with one or both of the shown sequences, or 
15 the complementary sequences (since DNA is generally 

double -stranded) . The sequence may have the ability to 
promote transcription (i.e. have "promoter activity") 
in muscle cells, e.g. human muscle cells, or muscle - 
specific transcription. 
20 Further provided by the present invention is a 

nucleic acid construct comprising a utrophin promoter 
region or a fragment, mutant, allele, derivative or 
variant thereof able to promoter transcription, 
operably linked to a heterologous gene, e.g. a coding 
25 sequence. By " heterologous" is meant a gene other than 
utrophin. Modified forms of utrophin are generally 
excluded. Generally, the gene may be transcribed into 
mRNA which may be translated into a peptide or 
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polypeptide product which may be detected and 
preferably quantitated following expression. A gene 
whose encoded product may be assayed following 

» 

expression is termed a "reporter gene", i.e. a gene 
5 which "reports" on promoter activity. 

The reporter gene preferably encodes an enzyme 
which catalyses a reaction which produces a detectable 
signal, preferably a visually detectable signal, such 
as a coloured product. Many examples are known, 

10 including 0-galactosidase and lucif erase. 0- 

galactosidase activity may be assayed by production of 
blue colour on substrate, the assay being by eye or by 
use of a spectrophotometer to measure absorbance. 
Fluorescence, for example that produced as a result of 

15 luciferase activity, may be quantitated using a 

spectrophotometer. Radioactive assays may be used, for 
instance using chloramphenicol acetyltransf erase, which 
may also be used in non- radioactive assays. The 
presence and/or amount of gene product resulting from 

20 expression from the reporter gene may be determined 

using a molecule able to bind the product, such as an 
antibody or fragment thereof. The binding molecule may 
be labelled directly or indirectly using any standard 
technique . 

25 Those skilled in the art are well aware of a 

multitude of possible reporter genes and assay 
techniques which may be used to determine gene 
activity. Any suitable reporter/assay may be used and 



WO 96/34101 PCT/CB96/00979 

10 

it should be appreciated that no particular choice is 
essential to or a limitation of the present invention. 

Expression of a reporter gene from the promoter 
may be in an in vitro expression. system or may be 
5 intracellular (in vivo) . Expression generally requires 
the presence, in addition to the promoter which 
initiates transcription, a translational initiation 
region and transcriptional and translational 
termination regions* One or more introns may be 

10 present in the gene, along with mRNA processing signals 
(e.g. splice sites) . 

Systems for cloning and expression of a 
polypeptide in a variety of different host cells are 
well known. Suitable host cells generally include 

15 bacteria, mammalian cells, yeast and baculovirus 

systems. Mammalian cells are preferred for use in the 
present invention. Mammalian cell lines available in 
the art for expression of a heterologous polypeptide 
include Chinese hamster ovary cells, HeLa cells, baby 

20 hamster kidney cells, human glioma cells, and many 

others, including other human cell lines. A common, 
preferred bacterial host is E. coli. 

The present invention also provides a nucleic acid 
vector comprising a promoter as disclosed herein. Such 

25 a vector may comprise a suitably positioned restriction 
site or other means for insertion into the vector of a 
sequence heterologous to the promoter to be operably 
linked thereto. 
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Suitable vectors can be chosen or constructed, 
containing appropriate regulatory sequences, including 
promoter sequences, terminator fragments, 
polyadenylation sequences, enhancer sequences, marker 
5 genes and other sequences as appropriate. For further 
details see, for example, Molecular Cloning: a 
Laboratory Manual: 2nd edition, Sarabrook et al, 1989, 
Cold Spring Harbor Laboratory Press. Procedures for 
introducing DNA into cells depend on the host used, but 

10 are well known. 

Thus, a further aspect of the present invention 
provides a host cell containing a nucleic acid 
construct comprising a promoter element, as disclosed 

r- 

herein, operably linked to a heterologous gene. A 
15 still further, aspect provides a method comprising 

introducing such a construct into a host cell. The 
introduction may employ any available technique, 
including, for eukaryotic cells, calcium phosphate 
transfection, DEAE-Dextran transf ection, 
20 electroporation, liposome -mediated transfection and 
transduction using retrovirus. 

The introduction may be followed by causing or 
allowing expression of the heterologous gene under the 
control of the promoter, e.g. by culturing host cells 
25 under conditions for expression of the gene. 

In one embodiment, the construct comprising 
promoter and gene is integrated into the genome (e.g. 
chromosome) of the host cell. Integration may be 
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promoted by inclusion in the construct of sequences 
which promote recombination with the genome, in 
accordance with standard techniques. 

Many known techniques and protocols for 
5 manipulation of nucleic acid, for example in 

preparation of nucleic acid constructs, mutagenesis, 
sequencing, introduction of DNA into cells and gene 
expression, and analysis of proteins, are described in 
detail in Short Protocols in Molecular Biology, Second 
10 Edition, Ausubel et al. eds., John Wiley & Sons, 1992, 
the disclosure of which is incorporated herein by 
reference . 

Nucleic acid molecules, constructs and vectors 
according to the present invention may be provided 

15 isolated and/or purified (i.e. from their natural 
environment) , in substantially pure or homogeneous 
form, free or substantially frae of a utrophin coding 
sequence, or free or substantially free of nucleic acid 
or genes of the species of interest or origin other 

20 than the promoter sequence. Nucleic acid according to 
the present invention may be wholly or partially 
synthetic. The term 11 isolate" encompasses all these 
possibilities . 

Nucleic acid constructs comprising a promoter (as 

25 disclosed herein) and a heterologous gene (reporter) 
may be employed in screening for a substance able to 
modulate activity of the promoter. For therapeutic 
purposes, e.g. for treatment of muscular dystrophy, a 
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substance able to up -regulate expression of the 
promoter may be sought. A method of screening for 
ability of a substance to modulate activity of a 
utrophin promoter may comprise contacting an expression 
5 system, such as a host, cell, containing a nucleic acid 
. construct as herein disclosed with a test or candidate 
substance and determining expression of the 
heterologous gene. 

The level of expression in the presence of the 
10 test substance may be compared with the level of 

expression in the absence of the test substance. A 
difference in expression in the presence of the test 
substance indicates ability of the substance t6 
modulate gene expression. An increase in expression of 
15 the heterologous gene compared with expression of 
another gene not linked to a promoter as disclosed 
herein indicates specificity of the substance for 
modulation of the utrophin promoter. 

A promoter construct may be trans fee ted into a 
20 cell line using any technique previously described to 
produce a stable cell line containing the reporter 
construct integrated into the genome. The cells may be 
grown and incubated with test compounds for varying 
times. The cells may be grown in 96 well plates to 
25 facilitate the analysis of large numbers of compounds. 
The cells may then be washed and the reporter gene 
expression analysed. For some reporters, such as 
lucif erase the cells will be lysed then analysed. 
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Previous experiments testing the effects of 
glucocorticoids on the endogenous utrophin protein and 
RNA levels in myoblasts have already been described 
[12,13] and techniques used for those experiments may 
5 similarly be employed. 

Constructs comprising one or more developmental 
and/or time-specific regulatory motifs (as discussed) 
may be used to screen for a substance able to modulate 
the corresponding aspect of the promoter activity, e.g. 

10 muscle and/or synapse -specific expression. 

Following identification of a substance which 
modulates or affects utrophin promoter activity, the 
substance may be investigated further. Furthermore, it 
may be manufactured and/or used in preparation, i.e. 

15 manufacture or formulation, of a composition such as a 
medicament, pharmaceutical composition or drug. These 
may be administered to individuals. 

Thus, the present invention extends in various 
aspects not only to a substance identified using a 

20 nucleic acid molecule as a modulator of utrophin 

promoter activity, in accordance with what is disclosed 
herein, but also a pharmaceutical composition, 
medicament, drug or other composition comprising such a 
substance, a method comprising administration of such a 

25 composition to a patient, e.g. for increasing utrophin 
expression for instance in treatment of muscular 
dystrophy, use of such a substance in manufacture of a 
composition for administration, e.g. for increasing 
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utrophin expression for instance in treatment of 
muscular dystrophy, and a method of making a 
pharmaceutical composition comprising admixing such a 
substance with a pharmaceutically acceptable excipient, 
5 vehicle or carrier, and optionally other ingredients. 
Administration will preferably be in a 
"therapeutically effective amount", this being 
sufficient to show benefit to a patient. Such benefit 
may be at least amelioration of at least one symptom. 

10 The actual amount administered, and rate and time- 

course of administration, will depend on the nature and 
severity of what is being treated. Prescription of 
treatment, eg decisions on dosage etc, is within the 
responsibility of general practitioners and other 

15 medical doctors. 

A composition may be administered alone or in 
combination with other treatments, either 
simultaneously or sequentially dependent upon the 
condition to be treated. 

20 Pharmaceutical compositions according to the 

present invention, and for use in accordance with the 
present invention, may comprise, in addition to active 
ingredient, a pharmaceutically acceptable excipient, 
carrier, buffer, stabiliser or other materials well 

25 known to those skilled in the art. Such materials 

should be non- toxic and should not interfere with the 
efficacy of the active ingredient. The precise nature 
of the carrier or other material will depend on the 
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route of administration, which may be oral, or by 
injection, e.g. cutaneous, subcutaneous or intravenous. 

Pharmaceutical compositions for oral - 
administration may be in tablet, capsule, powder or 
5 liquid form. A tablet may comprise a solid carrier 

such as gelatin or an adjuvant. Liquid pharmaceutical 
compositions generally comprise a liquid carrier such 
as water, petroleum, animal or vegetable oils, mineral 
oil or synthetic oil. Physiological saline solution, 
10 dextrose or other saccharide solution or glycols such 
as ethylene glycol, propylene glycol or polyethylene 
glycol may be included. 

For intravenous, cutaneous or subcutaneous 
injection, or injection at the site of affliction, the 
15 active ingredient will be in the form of a parenterally 
acceptable aqueous solution which is pyrogen- free and 
has suitable pH, isotonicity and stability. Those of 
relevant skill in the art are well able to prepare 
suitable solutions using, for example, isotonic 
20 vehicles such as Sodium Chloride Injection, Ringer's 

Injection, Lactated Ringer's Injection. Preservatives, 
stabilisers, buffers, antioxidants and/or other 
additives may be included, as required. 

Instead of a substance identified using a promoter 
25 as disclosed herein, a mimetic or mimick or the 

substance may be designed for pharmaceutical use. The 
designing of mimetics to a known pharmaceutical^ 
active compound is a known approach to the development 
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of pharmaceuticals based on a "lead" compound. This 
might be desirable where the active compound is 
difficult or expensive to synthesise or where it is 
unsuitable for a particular method of administration, 
5 eg peptides are unsuitable active agents for oral 
compositions as they tend to be quickly degraded by 
proteases in the alimentary carnal. Mimetic design, 
synthesis and testing may be used to avoid randomly 
screening large number of molecules for a target 
10 property. 

There are several steps commonly taken in the 
design of a mimetic from a compound having a given 
target property. Firstly, the particular parts of the 
compound that are critical and/or important in 
15 determining the target property are determined. In the 
case of a peptide, this can be done by systematically 
varying the amino acid residues in the peptide, eg by 
substituting each residue in turn. These parts or 
residues constituting the active region of the compound 

» 

20 are known as its " pharmacophore w . 

Once the pharmacophore has been found, its 
structure is modelled to according its physical 
properties, eg stereochemistry, bonding, size and/or 
charge, using data from a range of sources, eg 

25 spectroscopic techniques, X-ray diffraction data and 

NMR. Computational analysis, similarity mapping (which 
models the charge and/or volume of a pharmacophore, 
rather than the bonding between atoms) and other 
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techniques can be used in this modelling process . 
In a variant of this approach, the three-dimensional 
structure of the ligand and its binding partner are 
modelled. This can be especially useful where the 
5 ligand and/or binding partner change conformation on 

binding, allowing the model to take account of this the 
design of the mimetic. 

A template molecule is then selected onto which 
chemical groups which mimic the pharmacophore can be 

10 grafted. The template molecule and the chemical groups 
grafted on to it can conveniently be selected so that 
the mimetic is easy to synthesise, is likely to be 
pharmacologically acceptable, and does not degrade in 
vivo, while retaining the biological activity of the 

15 lead compound. The mimetic or mimetics found by this 
approach can then be screened to see whether they have 
the target property, or to what extent they exhibit it. 
Further optimisation or modification can then be 
carried out to arrive at one or more final mimetics for 

20 in vivo or clinical testing. 

Mimetics of substances identified as having 
ability to modulate utrophin promoter activity using a 
screening method as disclosed herein are included 
within the scope of the present invention. 

25 Modifications to and further aspects and 

embodiments of the present invention will be apparent 
to those skilled in the art. All documents mentioned 
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herein are incorporated by reference. 

Experimental basis for and embodiments of the 
present invention will now be described in more detail, 
by way of example and not limitation, and with 
5 reference to the following figures: 

Figure 1 shows overlapping yeast artificial 
chromosomes (YACs) covering the complete genomic region 
of human utrophin. 

Figure 2 shows the nucleic acid sequence of a 

10 124 6bp Hindi I I fragment which comprises a human 
utrophin promoter and first utrophin gene exon, 
including restriction endonuclease sites which may be 
used to derive reporter constructs , and putative 
transcription factor binding sites. 

15 Figure 3 shows a restriction endonuclease map of 

the 1246bp Hindlll fragment of Figure 2, and various 
constructs used in the functional assay of the utrophin 
promoter, each construct comprising a lucif erase gene 
(denoted by the hatched bar) operably linked to a 

20 different fragment of the 1246bp region. H=HindIII; 
X=XhoI ; S=SmaI , P=PstI; E-EagI and n =" refer to 

lucif erase activity. 

Figure 4 shows normalised luciferase activity 
resulting from transcription of the utrophin promoter 

25 constructs shown in Figure 3, expressed as the 

percentage of the activity of the pHH construct. The 
histogram shows the mean of the results from 
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transfection experiments. 

Figure 5 shows sequence and phylogenetic 
conservation of a promoter region of utrophin and 
untranslated exon l. The human sequence is printed 
5 above and the mouse sequence below. The location of 
the major transcription initiation sites in human by 
primer extension assays and 5' RACE analysis are 

by closed T triangles and open 

triangles V. The major cap site found for mouse by 

10 RNase protection is indicated by an open circle O. 

Putative regulatory motifs are boxed. The primers A 
and B used for RT PCR in the mouse are indicated, and 
the splice junction for exon 1 - intron 1 is marked. 



Isolation of a utrophin promoter 

m 

15 The complete genomic region of human utrophin was 

cloned in a series of overlapping yeast artificial 
chromosomes (YACs) as shown in Figure 1. YAC 
4X23E3 (YAC23) was shown to contain only human utrophin 
exons 1 and 2 as well as containing a cluster of rare 

20 cutting enzymes. The corresponding genomic region was 
shown to be unmet hylated [9] . Considering these 

* 

results, YAC23 was thought to contain a utrophin 
promoter. 

YAC23 was randomly subcloned into AGEM-11 to 
25 generated a library. After screening with a 350bp 

SacII fragment containing the most 5' sequences of the 
utrophin cDNA f a number of positive hybridising phage 
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were further subcloned into pUC18 plasmid. One of 
these subclones, 1246bp Hindlll fragment was sequenced 
and shown to contain the putative utrophin promoter and 
first exon. Figure 2 shows this sequence and includes 
5 putative transcription factor binding sites, 

■ 

Reporter Gene Expression 

In order to identify the promoter element of 
utrophin, a series of reporter constructs containing 
different regions of the 5'UTR of the utrophin gene 
10 were assayed for transcriptional activity. 

Fire -fly lucif erase was used as a reporter gene 
and plasmids were constructed by cloning different 
fragments of the 5' UTR of the utrophin gene into a 
polylinker site 5' of the cDNA coding for the firefly 
15 luciferase in the pGL2-Basic vector (Promega) . Human 
cell line (IN157) cells were transfected with 50/ig of 
test plasmid and 10/ig of. an internal control plasmid 
pSV-0-gal (Promega) which contains the /5-galactosidase 
gene driven by the SV40 promoter. Transfection was 
20 performed by electroporation of 3x10* cells, 0.8ml 

volume of 0.4cm 2 cuvettes, at 0.25V and capacitance of 
960/iF. Cells were allowed .to recover on ice for 1 
minute before being plated into 4mls of growth media. 
Cells were harvested at 24 hours after transfection and 
25 were lysed in 400/il of lx Reagent Lysis Buffer 
(Promega) . 

The luciferase activity assay was initiated by 
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mixing 20^1 of cell lysate with lOOjil of luciferase 
assay reagent (Promega) and the light emitted was 
integrated over a 10 second interval on a luminometer 
(Turner design), and expressed as light units. 0- 
5 galactosidase activity was assayed using SOjtl of the 

same cell lysate mixed with an equal volume of 2x Assay 
Reagent (Promega) , incubated 37°C for 3 hours and the 
absorbance at 420nm measured using a spectrophotometer. 
The luciferase activity was normalised to the 0- 

10 galactosidase activity derived from the cot rans feet ed 
internal control plasmid to correct for differences in 
transfection efficiency. 

Figure 3 illustrates the different constructs used 
in the functional assay of the utrophin promoter. In 

15 Figure 4 the normalised luciferase activity resulting 
from transcription of the utrophin promoter constructs 
is expressed as the percentage of the activity of the 
pUPB-LUC construct. The histogram shows the mean of 
the results from three transfection experiments each of 

20 which was performed in triplicate. 

Several constructs display transcriptional 
activity as a result of the presence of the utrophin 
promter. Comparison of the activity between the 
constructs demonstrates that the minimal promoter 

25 element lies with the 200bp Pstl-EagI fragment (Figure 
4) . 

Deletions of other regions within the 5' flanking 
sequence, including the conserved E-box (nucleotides 
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498-503) and TTCCGG motif (nucleotides 591-596), do not 
affect appreciably the activity of the utrophin 
promoter in IN157 cells. However, this could be 
explained by the absence of appropriate transcriptional 
5 factors within the IN157 cells that may be capable of 
interacting with the DNA motifs. 

Analysis of the promoter region 

To delineate DNA elements that may regulate the 
activity of the utrophin gene promoter, the equivalent 

10 mouse promoter was isolated. A mouse genomic phage 
library was screened with the human 0.6kb probe. 
Positive phage were further subcloned into plasmid and 
a 2.2kb EcoRl subclone (M2.2) was sequenced. Figure 5 
shows an alignment between the two corresponding 

15 sequences, displaying limited overall conservation 

across exon 1 of human and mouse, which may reflect the 
lower evolutionary constraints of non-coding sequence. 
The consensus splice site for exon 1 is conserved 
between human and mouse genomic sequence. 

20 The sequence of the first exon and 5' flanking 

sequence has a very high GC content. Database searches 
of the human utrophin promoter sequence revealed 99% 
identity to an uncharacterised GC-rich sequence, 
identified as a CpG island according to the method 

25 described by Cross et al. (14). This sequence 

corresponds to position 988-1212 (Figure 5) which is in 
exon 1 of the human utrophin gene and correlates to our 
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previous findings that the 5' UTR of utrophin is 
untnethylated (9) . 

Sequence analysis reveals the absence of TATA and 
CAAT motifs common to eukaryotic promoters , and the 
5 presence of several potential binding sites for the 
transcription factor Spl. However, the Spl binding 
sites are not conserved between human and mouse and may 
indicate that these sites are not involved in the 
transcriptional regulation of utrophin or that there is 
10 no constraint for the position to be conserved between 
species . 

A consensus E-box element (CANNTG) , involved in 

» 

regulating muscle gene expression (17) , is conserved 
between both species (Figure 5) . A recently identified 
15 DNA element (TTCCGG) which appears to regulate synaptic 
expression of the mouse acetylcholine receptor 6- 
subunit is present in the 5' flanking sequence of both 
the human and mouse utrophin gene, although in 

■ 

different positions (Figure 5) . 

* 

» 

20 Identification of the putative transcription start site 

Different levels of utrophin expression were 
observed in human cell lines by RNase protection using 
a probe derived from the carboxy terminus. Utrophin 
was shown to be expressed at differing levels in the 

25 cervical cancer Hela cell line, adult kidney CL11T47 
cell line, adult myoblast primary culture and 
rhabdomyosarcoma IN157 cell line, with the highest 
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levels in IN157 and CL11T47. IN157 was the cell line 
used in the following assays to identify the start of 
human transcript* 

Primer Extension 
5 A 22-mer oligonucleotide (U25) with its 5' end 

starting at nucleotide 1088 in exon 1 of the human 
utrophin gene was used for primer extension assays with 
RNA isolated from Hela cells, CL11T47 kidney cells and 
rhabdomyosarcoma IN157 cells. The sizes of the bands 

10 generated by extension were I90bp and 13 8bp, which 

correspond to transcription initiation at nucleotides 
898 and 950 respectively on the human sequence (Figure 
5) . The larger band extends the start of transcription 
37bp upstream from the 5' end of the most 5' utrophin 

15 cDNA isolated. The larger product, representing an 

initiation site at nucleotide 898, was observed in all 
the cell lines analysed. However the smaller product 
was generated only in IN157 rhabdomyosarcoma cells and 
CL11T47 kidney cells. 

20 5' Race 

In order to characterise the 5' end of the human 

transcript, the oligonucleotide U25 r described for 

primer extension above, was used for cDNA synthesis and 

PCR amplification of RNA from the IN157 
25 rhabdomyosarcoma cell line. The amplified products 

were subcloned and hybridised with the 0.6kb cDNA 
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fragment containing the 5' end of utrophin. Sequence 
analysis of- the positive clones indicated a putative 

start site at position 907 on the human sequence 

<■■ 

(Figure 5) . This is 9bp downstream from the start site 
5 predicted from the major band generated in the primer 
extension assays. A 20-mer oligonucleotide (U71) with 
its 5' end corresponding to 170bp downstream from the 
start of exon 2 (within the untranslated region) was 
also used for RACE analysis with RNA from IN157 cells. 
10 The sequence of positive hybridising subcloned products 
predicted a putative initiation site at nucleotide 
1214. This may represent a more 3' transcription start 
site, although it does not correlate to the smaller 
product identified in the primer extension assays. 
15 The most 5-' site for transcription initiation of 

the human utrophin transcript predicts a putative cap 
site at nucleotide 898bp, which is 587bp upstream from 
the translation initiation codon, and estimates the 
size of exon 1 as 495bp. The identification of 
20 additional putative starts of transcription may 

represent multiple clustered cap sites for the utrophin 
transcript. The major transcription start sites of the 
utrophin mRNA lie between positions 898 to 916 of the 
human sequence (Figure 5) or positions 1604 to 1622 of 
25 the mouse sequence. 

RNase Protection 

Probably because of the very high GC content, we 
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experienced technical difficulties in applying RNase 
protection analysis across the 5' UTR of human 
utrophin, a problem also described in the analysis of 
the GC-rich promoter of the human Dp71 dystrophin 
5 isoform (16) . In order to try to circumvent this 

problem we used the mouse transcript, which is less GC 
rich, for RNase protection assays. A complementary RNA 
probe was synthesised encompassing the Smal-PstI 
fragment from positions 743-1051 (Figure 5) , which 

10 spans the putative cap site of the human transcript. 

Using an RNA probe complementary to sequence within the 
carboxy- terminal domain as a control for utrophin 
expression, we were able to demonstrate that the 
utrophin transcript was present at abundant levels in 

15 mouse lung, however using the Smal-PstI RNA probe at 
hybridisation temperatures of 42 # C and 50 *C we were 
unable to detect a protected product. A major band of 
approximately 130bp only started to appear at a 
hybridisation temperature of 55 *C and increased in 

20 intensity with increasing hybridisation temperatures. 
The protected fragment observed was reproducibly a 
broad band suggesting that the start of transcription 
may vary over several nucleotides. 

To demonstrate that this 130bp product was 

25 specific to utrophin, the Smal-PstI RNA probe was 

hybridised to RNA from mouse lung and liver and the 
protected fragment was detected at similar levels as 
observed using the carboxy- terminal probe. No 
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protected product was detected when hybridising the 
mouse Smal-PstI probe to RNA isolated from human 
rhabdomyosarcoma IN157 cells, indicating that the probe 
is specific to the mouse transcript. This protected 
5 band of approximately 130bp predicts the start of 

transcription to lie at approximately nucleotide 916 on 
the human sequence, which is 9bp and 18bp downstream 
from the most 5' cap site predicted by RACE and primer 
extension assays respectively for the human transcript 
10 (Figure 5). These findings suggest that a similar site 
for transcription initiation is utilised in human and 
mouse. It should be noted that the region of sequence 
encompassing the most 5' putative cap sites is highly 
conserved between the two species. 

15 RT-PCT 

In dystrophin, the 5' promoters regulate the 
tissue-specific expression of full-length isoforms, all 
of which have a unique first exon spliced to a common 
second exon. To test the extent the putative utrophin 
20 promoter utilises the first untranslated exon, RNA from 
mouse brain, skeletal muscle, small intestine, liver, 
lung, spleen, heart, kidney and eye were used as 
templates for cDNA synthesis and PCR amplification. 

Primers were designed to amplify a region across 
25 exon 1 and exon 2 thus eliminating the risk of false 
positives due to contamination by genomic DNA. 
Amplification using a forward primer (BFB) and reverse 
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primer (U71) with 5' ends complementary to position 
1230 in exon 1 and 172bp downstream of the start of 
exon 2, respectively, generated an expected product of 
283bp. A second pair of primers with 5' ends at 

5 position 894 in exon 1 (B2A) and 153bp from the start 
of exon 2 (U73) amplified the expected product of 
575bp. These expected products were observed in all 
mouse tissues tested- This suggests that the utrophin 
promoter described here is active in driving the 

10 expression of a transcript containing the untranslated 
first exon and the second exon, which has the 
translation initiation ATG, in all tissues tested. 

In addition, amplification with the primer 
starting at nucleotide 894 indicates that the mouse 

15 transcript extends as far upstream as that predicted 

from the human transcript. This extends the start site 
for the mouse transcript approximately 20bp upstream of 
that predicted from RNase protection assays and this 
may represent multiple 5' start sites for the mouse 

20 utrophin transcript, as observed for the human 
transcript. A comparatively lower level of 
amplification was observed for the more 5' forward 
primer (B2A) , which lies near the putative start of 
transcription, compared to primer (BFB) starting 

25 further downstream in exon 1. It is possible that this 
upstream primer is complementary to a region spanning 
the cap site and therefore the primer may only 
partially anneal to the 5' end of transcript and result 
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in the weak amplification observed for several PCR 
conditions. In support of this proposition, primers 
complementary to sequences further upstream than 
position 894 were unable to generate a product using a 
5 range of PCR conditions, indicating that there are no 
additional upstream cap sites. 



DISCUSSION 

Utrophin has a CpG island, in contrast to the 
muscle and brain promoters of the dystrophin gene which 
10 have no CpG island. We have isolated and characterised 
a CpG- rich genomic fragment which contains, the first 
exon and 5' flanking sequence of utrophin. The 
fragment is active in initiating transcription of a 
reporter gene in various cell lines, indicating that 
15 the utrophin promoter element may be conserved. A 

series of 5' -deleted fragments of the utrophin upstream 
flanking region were generated to determine the minimal 
promoter element. Provided the 155bp region containing 
the promoter element was intact, 5' deletions did not 
20 significantly alter the transcriptional activity in 
human rhabdomyosarcoma IN157 cells. Hence, the CpG- 
rich 155bp region characterised here may function as a 
basal promoter element, driving utrophin transcription 
in many cell types, and it may be the presence of 
25 specific transcription factors that regulate the level 
of utrophin expression. Future investigation into the 
transcriptional activity of these constructs in 
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regulatory DNA elements controlling utrophin 
expression. 

The utrophin promoter region has several putative 
5 Spl binding sites and is devoid of TATA or CAAT motifs. 
By primer extension, 5' RACE and RNase protection 
analysis, we have located several putative cap sites 
for the full-length utrophin transcript. Although we 
cannot rule out the possibility that these additional 
10 products arose due to premature termination by reverse 
transcriptase during primer extension and RACE 
analysis, these results would be consistent with the 
observation that CpG-rich TATA- less promoters of widely 
expressed genes usually contain several transcription 
15 initiation sites spread over a fairly large region, 
rather than at a single base position (9) . 

Genes with CpG-rich promoters were initially 
considered to express proteins with a housekeeping 
function in the cell. However, several genes devoid of 
20 TATA or CAAT motifs in their promoter regions have been 
shown to encode proteins that are highly regulated 
(17,22) . Expression of the acetylcholinesterase gene, 
which also has a CpG-rich promoter, is regulated during 
muscle cell differentiation and is localised 
25 specifically at the NMJ (18) . The typical "house- 
keeping" promoter of the Dp71 dystrophin isoform drives 
expression in specific cell types (16) . Utrophin, 
although expressed in all tissues, also appears to be 
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regulated in different cell types, for example there 
are relatively abundant levels in adult lung and higher 
levels in foetal muscle compared to adult skeletal 
muscle (19) . We have shown here that the utrophin 
5 transcript is detected at different levels in various 
human cell lines, Utrophin transcripts are also 
specifically localised during development, with initial 
accumulation in the neural tube and later becoming 
abundant at a variety of other sites such as the tendon 
10 primordia in the digits, the pituitary thyroid and 

adrenal glands, cardiac muscle, and the kidney and lung 
(20) . Taken together, these observations suggest that, 
although utrophin is expressed widely, there is also 
developmental- and tissue- specif ic regulation of 
15 expression in certain tissues. 

Several putative DNA motifs identified in the 5' 
flanking sequence may be involved in the control of 
utrophin muscle expression. We identified a conserved 
E-box, which is a binding site for helix- loop-helix 
20 proteins of the MyoDl family, including MyoDl myogenin, 
MRF4 and myf5. E-box motifs are found in the promoters 
of many muscle-specific genes, including the a, b and g 
acetylcholine receptor subunit genes. Given the co- 
localisation of utrophin with acetylcholine receptors 
25 at the NMJ, it would be of interest to determine 

whether myogenic factors regulate the expression of 
utrophin by interaction with this conserved E-box 
motif. The human and mouse utrophin 5' flanking region 
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contain the core sequence of the N box, an element 
shown to direct synapse-specific expression of the 
mouse acetylcholine receptor 6-subunit gene (15) . This 
TTCCGG motif restricts the expression of the d-subunit 
5 gene to the NMJ by enhancing expression at the endplate 
and acting as a silencer in extrajunctional areas. 
Sequences identical to this core sequence of the N box 
sure present in other AChR subunit genes and it is 
likely that this element regulates the synaptic 

10 expression of at least some of these genes (16) . The 
mRNA levels of N-CAM, 43K-rapsyn and s-laminin (21) 
were shown by in situ hybridisation to be concentrated 
at the synaptic sites. By sequence analysis, we have 
determined that the core sequence of the N box is 

15 present in the 5' flanking of b2-syntrophin, which is 
also localised specifically to the NMJ, however it is 
absent from the upstream sequence of al-syntrophin, 
which is expressed throughout the general sarcolemma. 
This suggests that there may be a general mechanism for 

20 selective transcription by synaptic nuclei and this may 
involve the interaction of a transcription factor (s) 
capable of recognising the N box sequence. 
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CLAIMS 

1. A nucleic acid isolate comprising a utrophin gene 
promoter. 

2. A nucleic acid isolate comprising a promoter which 
5 comprises a sequence of nucleotides selected from (i) 

that of the promoter shown in Figure 2 and (ii) that of 
the mouse promoter sequence shown in Figure 5 - 

3. A nucleic acid isolate comprising a promoter which 
comprises the sequence of nucleotides shown 5' to 

10 position 898 in Figure 2. 

4. A nucleic acid isolate comprising a promoter which 
comprises the sequence of nucleotides shown 5' to 
position 1603 of the mouse sequence shown in Figure 5- 

5. A nucleic acid isolate comprising a promoter which 
15 consists essentially of a fragment of the promoter 

whose sequence is shown in Figure 2. 

6. A nucleic acid isolate according to claim 5 
wherein the promoter comprises a CANNTG motif. 

7. A nucleic acid isolate according to claim 6 
20 wherein the motif is CAGGTG. 



8. A nucleic acid isolate according to any of claims 
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5 to 7 wherein the promoter comprises a TTCCGG motif . 

9. A nucleic acid isolate comprising a promoter which 
consists essentially of a fragment of the mouse 
promoter whose sequence is shown in Figure 5. 



5 10. A nucleic acid isolate according to claim 9 
wherein the promoter comprises a CANNTG motif. 

11. A nucleic acid isolate according to claim 10 
wherein the motif is CAGGTG. 



12. A nucleic acid isolate according to any of claims 
10 9 to 11 wherein the promoter comprises a TTCCGG motif. 

13. A nucleic acid isolate comprising a promoter which 
consists essentially of a sequence of nucleotides that 
is an allele, mutant or derivative, by way of addition, 
insertion, deletion or substitution of one or more 

15 nucleotides, of the promoter sequence shown in Figure 
2. 



14. A nucleic acid isolate comprising a promoter which 
consists essentially of a sequence of nucleotides that 
is an allele, mutant or derivative, by way of addition, 
20 insertion, deletion or substitution of one or more 

nucleotides, of the mouse promoter sequence shown in 
Figure 5 . 
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15 . A nucleic acid construct comprising the promoter 
of nucleic acid according to any of the preceding 
claims operably linked to a heterologous sequence. 



16. A nucleic acid construct according to claim 15 
5 wherein the heterologous sequence is a coding sequence 



17. A nucleic acid construct according to claim 16 
wherein the heterologous sequence encodes a reporter 
molecule . 



18. A host cell comprising a nucleic acid construct 
10 according to any of claims 15 to 17. 

19. A method comprising culturing a host cell 
according to claim 18 under conditions for 
transcription of said heterologous sequence from the 
promoter . 

15 20. A method according to claim 19 wherein the 

heterologous sequence is a coding sequence and the host 
cell is cultured under conditions for expression of the 
encoded peptide or polypeptide product. 



20 



21. A method according to claim 19 or claim 20 
comprising detection of transcription of the 
heterologous sequence or the encoded product. 
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22. A method of screening for ability of a substance 
to modulate activity of a utrophin promoter, the method 
comprising contacting an expression system containing a 
nucleic acid construct according to any of claims 15 to 

5 17 with a test or candidate substance and determining 
transcription of the heterologous sequence or 
expression of the encoded peptide or polypeptide 
product . 

23. A method according to claim 19 wherein the 

10 expression system comprises a host cell containing said 
nucleic acid construct. 

24. A method which comprises, following identification 
of a substance as having the ability to modulate 
activity of a utrophin promoter in accordance with a 

15 method according to claim 22 or claim 23, manufacture 
of the substance and/or use of the substance in 
manufacture or formulation of a composition. 
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Figure 1 
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Figure 2 

1 AAGCTTGGAA CAGGCTCTAT AACAGATGTG CGTTTGTGTG TGCATATTGG 

51 AAAACAGAAA AATAAGGTCA GCGCAAACAC TACTTGTAAT ACAAACTAAT 

101 GTAAGAGAAA ACTTCTTTTT CAATATTAAC AACGACACTA GAGAAATAAT 

151 GTAAAATAAA TATACGGTTG GAACTAGGGG TAAAAAAAAA ATCAGCAACG 

201 TCAGCAAACT GAGATGGGGT GAGTTGGAAG GCAGATTGGA ATTTATCTCT 

251 TAACAAAATA TCACCCTAAC TAGAGACCTG TTTTGCCTAA GGGGACGTGA 

Af-i 

301 CTCACATTTT CGGATAATCT GAATAAGGGG AATTGTGTCT GCTCGAGGCA 

351 TCCATTCTGG TTCGGTCTCC GGACTCCCGG CTCCCGGCAC GCACGGTTCA 

401 CTCTGGAGCG CGCGCCCCAG GCCAGCCAAG CGCCGAGCCG GGCTGCTGCG 

451 GGCTGGGAGG GOGCGCAGGG CCGGCGCTGA TTGACGGGGC GCGCAGTCAG 

501 GTOACTTGGG GCGCCAAGTT CCCGACGCGG TGGCCGCGGT GACGGCCGAG 

551 GCCCGGCAGA CGCTGACCCG GGAACGTAGT GGGGCTGATC TTCCGGAACA 

601 AAGTTGCTGG GCCGGCGGCG GCGGGGCGAG AGCGCCGAGG GGGAGCCGGA 

651 GCGCTGCAGA GGCGGGGGCC GGAGGGCTGG CGCTGATCTG CACCCTTCTC 

(rCZ | 

701 ATCTGGAGAG CGGACCCCTG GCTGCCCGGA GGCGAGCCCC TTCCCGGGGG 

SPl **<C 

751 GTGGGGGCGG CAACGCGCGA CCCAGCGGTC CTGCGCCCCA CCCTCCCTCC 

801 TCCGCCTCCA GCGCTCGGCT CCAACAAAGG GGCAGGCCCG CAGCGGGGAG 

851 GAGGAGGAGG AGCCGCCGAA GGAGCGAGCC TCTCTCGCGC ACAAAGTTGT 

901 GGAGTCGTTT TTCCTCGGAG CAGGGAAGCG GGCAfeCAGCA GCCGGCC GCG 

951 GGCTTTCTCC CGCCGAGGGG CGAGGAGGAG CCTCTGGCTC CAGAAGCCGA 

1001 TTGGGGAATC ACGGGGAGCG GCGCCCCCCT TCTTTTGGGT CATTTCTGCA 

1051 AACGGAAAAC TCTGTAGCGT TTGGCAAAGT TGGTGCCTGC GCGCCCCTTC 

1101 CAGGTTTGCG CTTTGACTGT TTTGTTTTTG GCGGAACTAC CAGGCAGGAA 

1151 GATTGCACAA GTAAGGGGCG TTTTCAGTCG GGTGTCAATT TCTCTTTCTT 

1201 TCTTTCTTTT TTTAAAATTT CGGTTCGTGT CTGCTTCTCC AAGCTT 
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Figure 3 
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Figure 5 
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BAD ORIGINAL 



INTERNATIONAL SEARCH REPORT 



Interns* t«i Applies* on No 

PCT/oB 96/09979 



A. CLASSIFICATION OF SUBJECT MATTER „ „ 

IPC 6 C12N15/12 C67K14/47 C12Q1/66 //C12N9/92.C12N9/38 



According to International Patent Oanincatioo (IPC) or to both national cUssification snd IPC 



a FIELDS SEARCHED 



Minimum documentation torched (dasnficatian system followed by classification symbols) 

IPC 6 C12N C07K C12Q 



to the extent chst such doctsnents ere included in the fields sesrehed 



sesreh terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Citation of docunent* with mdtcsnan. 



of the relevant psoases 



Kdevsnc id ctsnn no. 



HUH. MOL. GENET. , 
vol. 2, no. 11, 1993, 
pages 1765-1772, XPe9291G739 
PEARCE ET AL. : "The utrophin and 
dystrophin genes share similarities in 
genomic structure" 
see the whole document 



-/-- 



2,5,13 



m 



Further doenments ere listed in the 



of box C. 



□ 



arc toted in 



of ated documents 



"A* d ocum e nt defining the general cute of the art which is not 
considered to be of particular relevance 

*E* earlier document but published on or alter the interna&onaJ 
filing date 

"L" document which may throw doubts on priority dnm(i) or 
which is ated to establish the publication date of another 
atation or other speasl reason (as specified) 

'O* doc u men t referring to an oral disclosure, use, exhibition or 



later docmnent 
or priority date 
to 



after the international filing 
snd not m conflict with the application but 
the principle or theory underlying the 



document published prior to the internaoonat filing date but 
later than the priority date claimed 



'X* document of particular relevance; the daimed im 

cannot be conattdered novd or cannot be conadcrcd to 
involve an inventive step when the document is taken alone 

*Y" do cum e nt of particular relevance; the daimed invention 
cannot be conadcrcd to involve an inventive step when the 
document n co m bi n ed with one or more other such docu- 
ments, such co mb i n a t ion being obvious to s person dulled 
m the art, 

member of the 



9 aV* In i 1 1 m ■ ■! a 



patent family 



Dale of the actual completion of the international search 

13 August 1996 


Date of mailing of the international search report 

1 a 08. 96 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Palentlaan 2 
NL - 2210 HV Ri jtwi jt 
Td. ( + 31-70) 340-2040, Tk. 31 651 epo nl. 
Fsx <♦ 31-70) 340-3016 


Authorized officer 

Gac. G 



Fbn»PC!71SA/310<i 



shaM) (July im) 



page 1 of 2 





INTERNATIONAL SEARCH REPORT 


Imcror -iai Ape* 


tcaoon No 






PCT/uB 96/00979 


C^Contrauanon) DOCUMENTS CONSIDERED TO BE RELEVANT 


Catcgjory * 


Quacc of doomem. wrti mitntim, what tppnymte, o* Ox relevant ptsxtfa 


Relevant to claim No. 


Y 

X.P 


NATURE GENETICS, 
vol. 6, March 1994, 

paycS tjD-c'tH, ArCWUD/OlO/ 

CROSS ET AL.: "Purification of CpS 
islands using a methylated DNA binding 
column' 

see the whole document 
& DATABASE STRAND 

EHBL Empri AN : HS54B8R, 22 October 1995 
MAC DONALD ET AL.: 

see Sequence ml • £.01000 iiur cunv«* 1 son 
with seq. id. nr 1 and seq. id. nr 3). 




2,5.13 
2,5,13 


A 


DEV. DYNAMICS, 

vol. 198. December 1993, 

pages 254-264, XP088578141 

SCHOFIELD ET AL.: "Expression of the 

dystrophin -related protein (utropninj gene 

during mouse enfaryogenesis" 

see page 261 - page 262 




1-3,24 


A 


BIOTECH. BUSINESS NEWS, 

vol. 3, no. 72, 28 January 1994, 

page 15 XP000578922 

ONCOGENE SCI. CORPO. ET AL.: "Muscular 
dystrophy therapy sought" 
see abstract 




22-24 


A 


NATURE, 

vol. 360, 1992, 

pages 591-593, XP002010740 

TINSLEY ET AL.: 'Primary structure of 

dystrophin-related protein" 

see the whole document 




1 

• 



Form PCT/liA/Ui (omtiaonioe of maa* *Mt) \f91) 



page 2 of 2 



